Skip to main content

Chat Models

OrganizationModel NameAPI Model StringContext lengthQuantization
OpenAIGPT OSS 120Bopenai/gpt-oss-120b128000MXFP4
OpenAIGPT OSS 20Bopenai/gpt-oss-20b128000MXFP4
DeepSeekDeepSeek R1 Distill Llama 70Bdeepseek-ai/deepseek-r1-distill-llama-70b65000FP16
Mistral AIMistral (7B) Instruct v0.3mistralai/Mistral-7B-Instruct-v0.332768FP16
NVIDIANemotron Orchestrator 8Bnvidia/Orchestrator-8B16384FP16
NVIDIANemotron 3 Nano 30Bnvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16262144BF16
MicrosoftFara 7Bmicrosoft/Fara-7B8192FP16
MetaLlama 3.3 70B Instructmeta-llama/Llama-3.3-70B-Instruct8192FP16

Code Models

OrganizationModel NameAPI Model StringContext lengthQuantization
QwenQwen3 Coder 30B A3B InstructQwen/Qwen3-Coder-30B-A3B-Instruct131000FP16

Image Models

OrganizationModel NameAPI Model StringModel TypeDefault steps
Pruna AIP-Imagep-imageImage Generation
Pruna AIP-Image LoRAp-image-loraImage Generation
Pruna AIP-Image Editp-image-editImage Edit
Pruna AIP-Image Edit LoRAp-image-edit-loraImage Edit
Qwen Tongyi MAIZ Image TurboTongyi-MAI/Z-Image-TurboImage Generation9
Stability AIStable Diffusion 3.5 Largestabilityai/stable-diffusion-3.5-largeImage Generation30
QwenQwen Image EditQwen/Qwen-Image-EditImage Edit20

Audio Models

OrganizationModalityModel NameAPI Model String
OpenAISpeech-to-TextWhisper Large v3openai/whisper-large-v3

Video Models

OrganizationModel NameAPI Model StringMax DurationMax Resolution
Pruna AIP-Videop-video10 seconds1080p

OCR Models

OrganizationModel NameAPI Model StringContext length
TencentHunyuan OCR (1B)tencent/HunyuanOCR16000

Vision Models

OrganizationModel NameAPI Model StringContext length
QwenQwen3-VL 8B InstructQwen/Qwen3-VL-8B-Instruct32768
QwenQwen3-VL 30B A3B InstructQwen/Qwen3-VL-30B-A3B-Instruct128000
QwenQwen3.5 397B A17BQwen/Qwen3.5-397B-A17B256000
QwenQwen3.5 122B A10BQwen/Qwen3.5-122B-A10B256000
QwenQwen3.5 27BQwen/Qwen3.5-27B256000
QwenQwen3.5 35B A3BQwen/Qwen3.5-35B-A3B256000
QwenQwen3.5 FlashQwen/Qwen3.5-Flash1000000

Embedding Models

Model NameAPI Model StringModel SizeEmbedding DimensionContext Window
BGE-Large-EN-v1.5BAAI/bge-large-en-v1.5326M1024512